﻿Hourly AQI Monitoring Data

Source URL:  http://pan.baidu.com/s/1gd8GUxt#list/path=%2F

Source affiliation:   beijingair.sinaapp.com

------------------------------------
License: free for academic research, no commercial use, resale, or redistribution permitted.
------------------------------------

Published: Dec 2016
Editor: Lex Berman
CHGIS, Center for Geographic Analysis

Email: chgis@fas.harvard.edu
------------------------------------

Distribution URL: https://dataverse.harvard.edu/dataverse/beijing-air

Character set encodings:  
   UTF8  (https://en.wikipedia.org/wiki/UTF-8)


ABSTRACT:   AQI air quality observations from 1497 ground monitoring stations in China are collected hourly and aggregated into weekly observation files (.csv format).  The AQI values are coded to Stations which have mappable x, y coordinates in a seperate stations table.

STATION LOCATIONS:

The file containing station locations (1497) has been updated with estimated X, Y positions for the (15) rows that were missing those values.   The additional [note] field indicates that these were added "XY (Lex)"

The fields were also translated to ASCII field names:

监测点编码  StationID	
监测点名称  StationNM
城市	CityNM
经度	Long
纬度    Lat

OBSERVATION FILES:

The observation files are organized so that each TYPE of observation has one row for each HOUR being recorded.   Therefore you will find any of the (weekly aggregate observation files) to begin with rows , in which the first column is like the following example:

date	            hour	type	1001A
20160101	0	AQI	245
20160101	0	PM2.5	195
20160101	0	PM2.5_24h	67
20160101	0	PM10	200
20160101	0	PM10_24h	148
20160101	0	SO2	26
20160101	0	SO2_24h	17
20160101	0	NO2	95
20160101	0	NO2_24h	71
20160101	0	O3	10
20160101	0	O3_24h	29
20160101	0	O3_8h	11
20160101	0	O3_8h_24h	28
20160101	0	CO	3.4
20160101	0	CO_24h	1.771

which means that at midnight on the first day of 2016, the PM2.5 value for the stationID of 1001A = 195.

There are no metadata or units of measure provided with the original dataset.  We are assuming that these readings are the same as those reported in AQICN and PM25S:
FIELD_NAME	DESC_UNITS
AQI  calculated AQI value for the station and hour
PM2.5	Particulate Matter 2.5 micron diameter   µg/m3 (micrograms per cubic metre)
PM2.5_24h	  Average of the PM2.5 observations for the past 24 hours at time of reporting
PM10	Particulate Matter 10 micron diameter  µg/m3 (micrograms per cubic metre)
PM10_24h	  Average of the PM10 observations for the past 24 hours at time of reporting
SO2	SOX  SodiumDioxide  pphm (parts per hundred million)
SO2_24h	  Average of the SO2 observations for the past 24 hours at time of reporting
NO2	NOX  NitrogenDioxide   pphm (parts per hundred million)
NO2_24h	  Average of the NO2 observations for the past 24 hours at time of reporting
O3	Ozone   pphm (parts per hundred million)
O3_24h	  Average of the O3 observations for the past 24 hours at time of reporting
O3_8h	  Average of the O3 observations for the past 8 hours at time of reporting
O3_8h_24h	  Average of the O3 observations for the past 8 to 24 hours at time of reporting
CO	CarbonMonoxide  ppm (parts per million)
CO_24h    Average of the CO observations for the past 24 hours at time of reporting

The aggregated weekly file then extends for 1496 more columns.    Each new hour will then add the same number of rows with the new observations, and fills in all the columns from left to right for each stationID.

The files are saved with file names that indicate the first day of the week, and contain the aggregated info for each batch of hourly observations.

Please refer to the URL above.   We are unable to confirm or validate anything else about this data, but we note that it matches the values being reported by AQICN.ORG,  PM25S.COM.   In addition, the BerkeleyEarth data matches this unusual data format EXACTLY.

